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ABSTRACT 

Weak gravitational lensing studies of galaxy clusters often assume a spherical cluster model 
to simplify the analysis, but some recent studies have suggested this simplifying assumption 
may result in large biases in estimated cluster masses and concentration values, since clusters 
are expected to exhibit triaxiality. Several such analyses have, however, quoted expressions 
for the spatial derivatives of the lensing potential in triaxial models, which are open to misin- 
terpretation. In this paper, we give a clear description of weak lensing by triaxial NFW galaxy 
clusters and also present an efficient and robust method to model these clusters and obtain 
parameter estimates. By considering four highly triaxial NFW galaxy clusters, we re-examine 
the impact of the simplifying assumption of sphericity and find that while the concentra- 
tion estimates are largely unbiased, except in one of our triaxial NFW simulated clusters, the 
masses are significantly biased, by up to 40%, for all the clusters we analysed. Moreover, we 
find that erroneously assuming spherical symmetry can lead to the mistaken conclusion that 
some substructure is present in the galaxy clusters or, even worse, that multiple galaxy clus- 
ters are present in the field. Our cluster fitting method also allows one to answer the question 
of whether a given cluster exhibits triaxiality or a simple spherical model is good enough. 
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1 INTRODUCTION 

Clusters of galaxies are the most massive gravitationally bound ob- 
jects in the universe and, as such, are critical tracers of the for- 
mation of large-scale structure. The number counts of clusters as 
a function of their mass and redshift have been predicted both an- 
alytically (see e.g. lPress & Schechter|[l974l : ISheth et alj|200lh and 
from large scale nu merical simulations (see e.g. ljenkins et al . 2001 ; 
lEvrard et alj|2002h , and are particular ly sensitive to the cosm olog- 
ical parameters as and Sl m (see e.g. iBattve &~W eller 2003j). The 
size and formation history of massive clusters is such that the ra- 
tio of gas mass to total mass is expected to be representative of the 
universal ratio Jlb/^m, once the relatively small amount of bary- 
onic matter in th e cluster galaxies is taken into account (see e.g. 
IWhiteetal.lll993h . 

The study of cosmic shear has progressed rapidly in recent 
years, with the availability of high quality wide-field lensing data. 
Large dedicated surveys with ground-based telescopes have been 
employed to reconstruct the mass distribution of the universe and 
const r ain cosmological pa rameters (see e.g. iMassev et al.l 120071 . 
120051 : iHoekstra et al1 l2006). Weak lensing also allows one to de- 
tect galaxy clusters without making any assumptions about the 
baryon fraction, richness, morphology or dynamical state of the 
cluster, and so weak lensing cluster modelling would allow one to 
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test these assumptions by observing the cluster with optical, X-ray 
or Sunyaev-Zel'dovich (SZ) methods. Nonetheless, weak lensing 
studies are very challenging, not only because the data are very 
noisy, but also because of the inherent under-determined nature of 
the problem, namely constraining the 3-D structure of the galaxy 
cluster using lensing observations that are sensitive only to the 2-D 
projected mass distribution, at least in the thin lens approximation. 

A model cluster density profile can be determined from nu- 
merical A-body simulations of large-scale structure formation in a 
ACDM unive rse. In particular, as suming spherical symmetry, the 
NFW profile dNavarro et ai1ll997h provides a good fit to the simu- 
lations. The NFW profile is parameterized by the virial mass M200 
and the concentration parameter C, which are highly-correlated. 
The concentration parameter is directly related to the central den- 
sity of the cluster, which depends on cluster's formation history. 
Since galaxy clusters are the most recently bound objects in the 
universe, the cluster concentrations serve as probes of the mean 
density of the universe at lower redshifts. A-body simulations pre- 
dict C ~ 4 for a cluster of mass M 20 o = W^h^MQ. sev- 
eral studies, assuming a spherical NFW model, have reported weak 
lens ing results with very high concentration values of C ~ 8 (se e 
e.g. iLimousin etai]|2007l : iKneib et alj|2003l : iGavazzi et ai1l2003h . 
casting doubt on the ACDM model. In order to shed further light 
on this discrepancy, it is important to understand to what extent 
can these results be explained by deviations from spherical symme- 
try. Another reason for studying the triaxiality of galaxy clusters is 
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that several studies on structure formation in AC DM suggest clus - 
ters often exhibit significant triaxiality (see e.g. iBett et afl d2007l) ; 
IShawetalJfioO^ ). 

Several attempts have already been made towards understand- 
ing the impact of triaxiality in explaining the devi ations from 
ACDM of the observed gravitational lensing results. Ijing & Sutol 
(2002) first presented a full parameterizatio n for a triaxial NFW 
halo. lOguri et al.l d2005h and (Gavazzll J2005h showed that a triax- 
ial NFW fit to gravitational lensing observations of clusters Abell 
1689 and MS2 137-2 3 yields parame t er esti mates consistent with 
A^-bod y simulation. ICorless & Kingl 120071) and ICorless & King 
(2008) presented a general Markov Chain Monte Carlo (MCMC) 
method for fitting a triaxial NFW model to weak lensing observa- 
tions. From the analysis of simulated lensing observations of triax- 
ial clusters, they concluded that significant elongation of the cluster 
along the line of sight can cause its mass to be overestimated by up 
to 50 per cent and its concentration by as much as a factor of 2, 
if one simply assumes a spherical NFW model when analysing its 
lensing signal. This claim is important since it would significantly 
ease the tension between the ACDM model and the very high val- 
ues of the concentration parameter reported in a number of clus- 
ters, derived from lensing observations assuming a spherical NFW 

model. 

The triaxial lensing equati ons quoted in ICorless &^ Kine 

d2007l) and lCorless & Kind ( 2008) adopted the triaxial NFW model 
equations from lKeetonl ( 1200 lh . which can easily be misinterpreted, 
and also had some additional errors, although the correct form of 
equations were used in their analysis code (King, private commu- 
nication). In any case, our purpose in this paper is to present a 
clear description of the weak lensing equations for the triaxial NFW 
model and simultaneously introduce an efficient, robust and accu- 
rate method, based on nested sampling, for fitting triaxial NFW 
model to weak lensing observations. In particular, we study the 
discrepancies that may arise in fitting a spherical NFW model to 
a galaxy cluster exhibiting significant triaxiality. 

The outline of this letter is as follows. In Sec. [2] we introduce 
Bayesian inference, which we use for cluster fitting. In Sec. [3] we 
describe the triaxial NFW model and derive the relevant equations. 
We apply our method to a set of simulated weak lensing simulations 
in Sec. [4] Finally we present our conclusions in Sec. [5] 



2 BAYESIAN INFERENCE 

Our cluster fitting methodology is built upon the principles of 
Bayesian inference, which provides a consistent approach to the 
estimation of a set of parameters © in a model (or hypothesis) H 
for the data D. Bayes' theorem states that 



Pr(0|D,H) 



Pr(D| &,H)Pt(&\H) 
Pr(D|tf) ' 



(1) 



where Pr(0jD,i/) = P(0) is the posterior probability distri- 
bution of the parameters, Pr(D|0, H) = £■(&) is the likelihood, 
Pr(®\H) = 7r(0) is the prior, and Pr(DjH) = Z is the Bayesian 
evidence. 

In parameter estimation, the normalising evidence factor is 
usually ignored, since it is independent of the parameters ©, and 
inferences are obtained by taking samples from the (unnormalised) 
posterior using standard MCMC sampling methods, where at equi- 
librium the chain contains a set of samples from the parameter 
space distributed according to the posterior. This posterior consti- 
tutes the complete Bayesian inference of the parameter values, and 



can be marginalised over each parameter to obtain individual pa- 
rameter constraints. 

In contrast to parameter estimation problems, for model se- 
lection the evidence takes the central role and is simply the factor 
required to normalize the posterior over 0: 



Z = / £(0)7r(0)d D 0, 



(2) 



where D is the dimensionality of the parameter space. As the av- 
erage of the likelihood over the prior, the evidence is larger for a 
model if more of its parameter space is likely and smaller for a 
model with large areas in its parameter space having low likeli- 
hood values, even if the likelihood function is very highly peaked. 
Thus, the evidence automatically implements Occam's razor. The 
question of model selection between two models Ho and Hi can 
then be decided by comparing their respective posterior probabili- 
ties given the observed data set D, as follows 



R : 



Pr(#i|D) _ Pr(D|#i)Pr(i/i) _ Z 1 Pr(ffi) 



Pr(#o|D) Pr(D|# )Pr(#o) Z Pr(H ) 



(3) 



where Pr(Hi)/ Pr(Ho) is the a priori probability ratio for the two 
models, which can often be set to unity but occasionally requires 
further consideration. 

Evaluation of the multidimensional integral in (2) is a chal- 
lenging numerical task. Standard techniques like thermodynamic 
integration are extremely computationally intensive which makes 
evidence evaluation at least an order of magnitude more costly 
than parameter estimation. Some fast approximate methods have 
been used for evidence evaluation, such as treating the pos- 
terior as a multivariate Ga ussian centred at its peak (see e.g. 
iHobson & McLachlanll2003h . but this approximation is clearly a 
poor one for multimodal posteriors (except perhaps if one performs 
a separate Gaussian approximation at each mode). The Savage- 
Dickey density ratio has also been proposed (see e.g. lTrottall2007h 
as an exact, and potentially faster, means of evaluating evidences, 
but is restricted to the special case of nested hypotheses and a 
separable prior on the model parameters. Various alternative infor- 
mation crite ria for astrophysical model selection are discussed by 
iLiddld ( 120071) , but the evidence remains the prefer red method. 

The nested sampling approach, introduced by Skil lind d2004l) . 
is a Monte Carlo method targeted at the efficient calculation of the 
evidence, but also prod uces posterior inferenc es as a by-product. 
iFeroz & HobsorJ d2008l) and lFeroz et"al] d2009l) built on this nested 
sampling framework and have introduced the MultiNest algo- 
rithm which is very efficient in sampling from posteriors that may 
contain multiple modes and/or large (curving) degeneracies and 
also calculates the evidence. This technique has greatly reduced the 
computational cost of Bayesian parameter estimation and model se- 
lection, and is employed in this paper. 



3 WEAK LENSING BY TRIAXIAL GALAXY CLUSTERS 



3.1 Weak lensing background 



A cluster mass distribution is investigat ed using weak gravitationa l 
lensing through the relationship (see e.g. ISchramm&KavseJl 995 ) 
(e(sc)} = g(x), that is, at any point x on the sky, the local average 
of the complex ellipticities e = ei + ie2 of a collection of back- 
ground galaxy images is an unbiased estimator of the local complex 
reduced shear field, g = gi + ig2, due to the cluster. Adopting the 
thin-lens approximation, for a projected mass distribution £(as) in 
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the lens, the reduced shear g(x) is defined as 



9( x ) 



7(as) 



(4) 



where the convergence k(x) = E(a;)/E cr it and the shear 7(x) 
can, in general, be w ritten as a convolut ion integral over the conver- 
gence tt(x) (see e.g. lBridle et alJl99S). E cr it is the critical surface 
mass density 



Ecrit 



(5) 



4nG A As ' 

where D s , D\ and As are the angular-diameter distances between, 
respectively, the observer and each galaxy, the observer and the 
lens, and the lens and each galaxy. In general, the redshifts of each 
background galaxy can be different, but are assumed to be known. 
The lensing effect is weak or strong if k <C 1 or k > 1 respectively. 
The observed ellipticity e(x) can be c onverted to source elliptic ity 
(i.e. prior to lensing) e s (a;) as follows (Seitz & Schneider 1 99% : 



eJw±-gM /I I < 

l-9*(<B)e(a>) ** > 

l-g(x) £ *(x) /I I ,N 

e*(x)- 9 *(x) \\y\ > 1 ) 



(6) 



where an asterisk denotes complex conjugation. The inverse trans- 
formation can be obtained by swapping e s (a;) with e(x) and by 
replacing g(x) with —g(x). 

The number count of the background galaxies is changed by 
lensing due to two competing effects: (a) some faint sources in 
highly magnified regions are made brighter and pushed over the 
flux limit, but (b) the same sources are stretched across the sky re- 
ducing the number count of these sources. The observed number 
density of sources, n is related to the actual background number 
density no by the flux limit a through n = no/i" -1 , where u is the 
lensing magnification defined by p -1 = (1 — k) 2 — \ j\ 2 . 



3.2 Triaxial NFW model 

A model cluster density profile can be determined from numerical 
TV-body simulations of large-scale struct ure formation in a A CDM 
universe. In particular the NFW profile dNavarro et al. 1997) pro- 
vides a good fit to the simulations. Assuming spherical symmetry, 
the NFW profile is given by: 



p(r) 



{r/r s )(l + r/r E ) 2 ' 



(7) 



where r s and p s are the radius and density at which the logarithmic 
slope breaks from —1 to —3. The extension to a triaxial cluster is 
performed simply by defining the triaxial radius r, such that 



y*2 y2 ^2 

r2 = — + -gr + — > (a<&<c = i), 



(8) 



where the coordinates X, Y and Z lie along the principal axes of 
the cluster, and a, b and c are the semi-minor, semi-intermediate 
and semi-major axes respectively of the iso-density ellipsoid r — 1. 
We define M200 as the mass contained within the triaxial radius 
r = r2oo at which the density is 200 times the cosmological crit- 
ical density p cr it at the redshift of the cluster. The concentration 
parameter C is a measure of the halo concentration and is defined 

as C = r<ioo/r s . 

Following IOgurietalJ ( f2003h . we define the orientation angles 
(8, 4>) such that they represent the polar angles of the line-of-sight 
of the observer, as measured in the principal coordinate system of 
the triaxial cluster. We set the semi-major axis c = 1 and therefore 



the parameters a and b serve as the minor:major and intermedi- 
ate:major axis ratios respectively. Thus, for a triaxial NFW, cluster 
parameters are © = (xq, yo, a, b, 9, (j>, M200, C, z), where xo and 
1/0 are the spatial coordinates in the observers a;j/-plane at which 
the cluster is centred, and z is its redshift. 

The projected surface mass density of a triaxial cluster will 
have elliptical symmetry and by translating the observer's xy-p\ane 
such that the cluster centre is at the origin, and subsequently ro- 
tating the coordinate system by the position angle ip of the iso- 
surface-density ellipses, the elliptical radius £, in the resulting 
(x, y) coordinate system, can be written as 



(9) 



where q x and q v are the semi-major and semi-minor axes, respec- 
tively, of the ellipse f = 1, which corresponds to the projection of 
the ellipsoid r = 1. The position angle ip is given by 



^iarctanf-^ 
2 \A - C 



and the projected axis ratios read 
2/ 



Qx = 



2 
Qy 



where 



A + C - yj (A - Cf + B 2 
2/ 



A + C+ J(A-cy +B 2 



c 2 c 2 
/ = sin 6 [ — 7t cos 6 + — r sin 1 



+ cos 



(10) 



(11) 



(12) 



(13) 



A 



B 



' 2 

2 a I C ■ 2 , c 2 
cos I — sin (f> + — r cos ( 

a 2 b 2 



a 2 b 2 



cos 6 sin 2(f) 



a 2 b 2 



r 2 r 2 

n 2 y 1 2 

C = —sin </H — - cos 1 
b 2 a 2 



(15) 



(16) 



Note that qx ^ §y for the given rotation angle tjj. We define the 
axis ratio q as q = qy /qx, which defines the ellipticities of the 
projected iso-density contours of the triaxial cluster. 

The lensing shear fields 71 (i, y) and 72(18, y), and conver- 
gence y) in the (x, y) coordinate system can be calculated as 
the second derivatives of the lensing potential y) = &(x, y) 
as follows (with commas indicating differentiation): 



72 = <f>,xy 
1 / = 



(17) 
(18) 
(19) 



In order to apply the method of lSchrarmr] dl990f) and lKeetonl < |200 lh 
to evaluate these lensing formulae, it is necessary, however, to per- 
form a further coordinate scaling x = x/q x , y = y/qx such that 
the elliptical radius can be written as 



=2 

t 2 =2 . V 
C = X +—, 

q 2 



(20) 



and so the semi-major axis of the ellipse £ = 1 is equal to unity. 
In the (x, y) coordinate system, one can now express the sec- 
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ond derivatives of the lensing potential ${x, y) as follow^} 

$,ag(2,y) = qx 2 K (x,^) + qj (x,y), (21) 
*,S«(5,f/) = qy 2 K 2 (x,y) + qJi(x y y), (22) 
= giyA"i(f,f), (23) 

where 

u«'(C(u)) 



K n (x,y) 



J n (x,y) = 



o CM[i-(i-g 2 H' 
«(C(«)) 

[1- (l-g 2 )u] n+1/2 



hl/2 



du, 



du, 



in which 



and 



C (u) 2 = u ( x 2 + 



(l-g 2 )« 



d( (u) 



(24) 



(25) 



(26) 



(27) 



The surface mass density n (f) in the above equations can be ex- 
pressed in terms of the spherical surface mass density k 3 (£) (see 
iBartelmannl 1996] for the expressi on for k s (£) for the NFW profile) 
as follows (see lOguri et alj |2003 for the derivation): 



«(0 



MO 
V7 ' 



(28) 



where / is given in Eq. l !13t . 

The shear and convergence fields in the (x, y) coordinates, 
given in Eqs. ( I17b -(ll9t are straigtforwardly calculated from the 
derivatives of the lensing potentials Eqs. d2 1 b — J23t using the fact 
that &.xx(x,y) = &,Sx(x,y), and similarly for the other terms 
(since the gravitational potential has a scaling dimension of 2). Fi- 
nally, the convergence and shear values in the observer's original 
(x, y) coordinate system (shifted such that the cluster centre is at 
the origin) are obtained by rotating the shear vector by angle ip, 
given in Eq. (10), as followfl 



n(x,y) = R(x,y), 

lx{x,y) = \j(x,y)\cos(2(a + tp)) 

72(x,y) = \j(x,y)\aux(2(a + ij})), 

where 



and 



Q = I arctan (54Ml 

2 \ii[x,y) 



(29) 
(30) 
(31) 



(32) 



(33) 



We have tested the triaxial NFW lensing equations given 



1 The expression for the integral J n (X, Y) in Sec. 4 of lKeetonl fcOOll) 
should have k{C, (u)) instead of k(£(u) 2 ), in the n umerator. 

2 Although the descriptio n of triaxial lensing in [Cprless & King 120071) 
an d lCorless & King] (2008) makes no mention of the successive coordinate 
rescaling and rotation discussed above, ICorlessI (2008) does mention that 
all triaxial projections are rotated by their position angle tfi before lensing 
integrals are calculated and that the resulting shear and convergence values 
are rotated back to the original orientation of the lens. Also, Eq. (A18) in 
ICorless & Ki ng 12008) should not have qx in the denominator. However, 
the correct form of equations were used in their analysis code (King, private 
communication) . 



di 



above by numerically integrating the mass density of a given tri- 
axial cluster, along the line-of-sight to calculate the surface mass 
density k(x, y) which was found to be equal to the k(x, y) calcu- 
lated by using Eqs. ( |19b and ( 128b . 

3.3 Weak lensing likelihood 

The observed complex ellipticity components of the iV ga i back- 
ground galaxies can be ordered into a data vector d with compo- 
nents 

(l < A^gal) 

(34) 

(A^ai + 1 < i < 27Y gal ) 

Likewise the corresponding components of the complex reduced 
shear g(xi) at each galaxy position, as predicted by the cluster 
model, can be arranged into the predicted data vector d F , with the 
arrangement of components matching d34b - 

The uncertainty on the components (i.e. real and imaginary 
parts) of the measured ellipticity e consists of two contributions. 
The components of the intrinsic ellipticity, e s , of the background 
galaxies (i.e. prior to lensing) may be taken as having been drawn 
independently from a Gaussian distribution with mean zero and 
variance af nt . If we denote distribution of e s by Pi n t(e s ), the dis- 
tribution of the lensed ellipticities p ei (ee) is then given by: 




Pe t {u) = PintOs) 



dee 



(35) 



where we have taken account of e s and both being complex vari- 
ables. The Jacobian determinant can be found using Eq.l(6j and is 
given by: 



i-|g|- 



de e 



71 (IffKl) 



(36) 



1 9 ' 



\g\ > i) 



The second source of uncertainty on the measured ellipticity is 
due to the errors introduced by the galaxy shape estimation proce- 
dure, such that e = ei + ZS.ee, where the components of Aee can be 
modelled as drawn from a Gaussian with mean zero and variance 
a 2 bs . Denoting this distribution by p bs(Ae^), the probability dis- 
tribution for the measured ellipticities can be obtained as follows: 



p.00 



Pe e (ee)p bs(e — ee)dee- 



(37) 



If the reduced shear is small i.e. for \g\ <^ 1, the Jacobian 
determinant in Eq. l |36b reduces to unity and we have pe e {ee) = 
Pint(e s ). This reduces Eq. < |37 b to a convolution of two Gaussians 
with means zero and standard deviations (Tint and a bs respectively. 
Consequently p e (e) is a Gaussian with mean zero and standard de- 
viation <Ti corresponding to the ith galaxy, given by: 

(38) 



2 2 
°"obs + °"in1 



This leads to a diagonal noise 
components. This is however, 
expected reduced shear from 
this study is not to o large, we 
As shown by iMarshall 
likelihood function as C(&) 
usual misfit statistic 



covariance matrix C on the ellipticity 
not the case when \g\ ~ 1. Since the 
the low redshift simulated clusters in 
adop t this a pproximation. 
et al.l (|2003), we can then write the 
1 • ~ ^ where \ 2 ls me 



= ^ 1 exp(- 



2X 



(d-d 1 



) T c-\d- 



N Z.*1 2 . . ,. 2 

^ _ ( e i,i ~ 9j(Xi)) 



EE- 

i=l 3=1 



(39) 
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Parameters 


Flat Priors 


Bett Priors 


Spherical Priors 


xo 


— 12 ^ xo $ 12 


— 12 $ xo ^ 12 


— 12 ^ xo $ 12 


yo 


-12 ^ ?/o < 12 


— 12 $ yo < 12 


-12 sg yo < 12 


a/0 


U.Z 5; dj ^ l.U 


U.oUU 1 U. Izo 


a/o — 1 


b 


0.2 ^ o ^ 1.0 


0.85 ± 0.10 


6 = 1 


0/rad 


< sin(e) s: 1 


s£ sin(0) sC 1 


= 


0/rad 


^ <j> < 2tt 


^ <t> s£ 2tt 


= 


M20o/h- 1 M Q 


14.0 < log 10 (M 20 o) < 16 


14.0 s£ log 10 (M 20 o) 16 


14.0 ^ log 10 (M 2 oo) < 16 


C 


< C < 15 


sC C < 15 


< C ^ 15 


z 


z = 0.2 


2 = 0.2 


2 = 0.2 



Table 1. Priors for the cluster parameters. Inequalities denote uniform prior probability between the given limits, equalities mean delta function priors, and 
(a ± ft) denotes a Gaussian prior with mean a and variance ft 2 . 



and the normalisation factor is Z L = (27r) 2Jv s.-»i / 2 1 C| 1/2 . Note that 
is it necessary to include this normalisation factor in the likelihood, 
since the covariance matrix C is not constant, but depends on the 
cluster model parameters through the predicted shear terms in J38l >. 

3.4 Priors on cluster parameters 

To determine the model completely it only remains to 
specify the prior 7r(©) on the cluster parameters © = 
(xo, yo, a, b, 9, ij>, M200, C, z). The choice of prior is particularly 
important for weak lensing analysis since the problem is inher- 
ently underconstrained and therefore any prior information avail- 
able about the cluster parameter is extremely useful. One should 
however be careful in the choice of priors not to impose too strong 
assumptions which may lead to erroneous inferences. 

We use three different priors which we call flat, Bett and spher- 
ical. These priors are specified in Tab. Q] In all 3 cases, we as- 
sume that cluster redshift is known. It should also be noted that 
although the orientation angles are defined over < 9 < n and 
< <j> < 2ir, because of the elliptical nature of the projected 
density contours, unique lensing profiles are possible only over the 
range < 9 < n/2 and < <j) < n. Apart from the spherical case, 
the prior on 9 is uniform in sin(f?). 

The most restrictive case is that of the spherical priors, which 
assume the axis ratios to be fixed a — b — 1, and hence correspond 
to a spherical cluster model. This is clearly a very strong assump- 
tion and there is a debate in the literature whether this assumption 
is behind the deviation s from ACDM of the o b served gravitationa l 
lensing results (see e.g. ICorless & Kin3 ( l2007h : lo"guri et alJd2005h : 
IGavazzil d2005l) ). The least restrictive case is given by the flat pri- 
ors, for which assume a uniform prior distribution on the axis ratios 
with 0.2 ^ 6 ^ 1.0 and 0.2 ^ a/b ^ 1.0. An intermediate choice 
is to use the distribution of axis ratios derived from A^-body simula- 
tions as the prior o n axis ratios. We therefo re also consider the Bett 
priors specified in ICorless & Kin3 J2008h . for which the distribu- 
tionof axis ratios is taken from A^-body simulations bv lBett et al.1 
( 120071) . Bett priors assume a Gaussian distribution on the axis ratios 
a/b and 6 with means and standard deviations given in Tab.Q] 



4 APPLICATION TO SIMULATED OBSERVATIONS 
4.1 Simulations 

Following ICorless & K ing (2008), we simulate prolate (a — b = 
0.4, i.e. 'cigar-shaped') and oblate (a — 0.4, 6=1, i.e. 'pancake- 
shaped') clusters in two orientations: (a) Line-of-sight (LoS): with 



the 'odd' axis along the line of sight (9 = for prolate and 9 = 
7r/2, = for oblate), and (b) Plane: with the 'odd' axis in the 
plane of the sky (9 = 7r/2 for prolate and 9 = 4> = for oblate). 
LoS orientation results in spherical projected iso-density contours 
for our chosen prolate and oblate clusters. In Plane orientation, the 
projected iso-density contours are elliptical with a position angle ij) 
of 7r/2 and 0, respectively, for prolate and oblate clusters. We also 
simulate a spherically symmetric cluster (a = b = 1) to determine 
the effects of fitting an (overly complicated) triaxial model to it. 

All the simulated clusters were assumed to have the NFW 
density profile as given in Eq. (0, with xo = 0, yo = 0, 
M200 = W 15 h~ 1 M Q , C = 4 and z = 0.2. The simulations 
were carried out for a field 12' in radius, with background source 
density no = 30/arcmin , typical of ground-based observations. 
We assume all the background galaxies lie at redshift 2 = 1, 
this is justified by the low redshift (2 = 0.2) of the simulated 
lens. The galaxies are positioned randomly in the field with their 
shapes drawn from a Gaussian distribution with standard deviation 
<T e = <7; 2 nt + cr 2 bs , where ajnt and <j b s are the intrinsic and ob- 
servational dispersion of galaxy shapes as discussed in Sec. 13. 31 We 
set o"i n t =0.2 and a b s = 0.1. These galaxies are then lensed by 
the cluster as described in Sec. 13. II The number density of back- 
ground galaxies is then red uced according to n = np/x°~ with 
the flu x limit a = 0.5, as in lFort et"al] d 1997t> and lCorless & Kirigl 
(2008). Throughout we assume a concordance ACDM cosmology 
(£2 m , = 0.3, «a,o = 0.7 and h = 0.7). 



4.2 Analysis and Results 

Fig. Q] shows the posterior probability distributions, marginalized 
in the i\f2oo-C* plane, obtained using the different priors discussed 
in Sec. 13.41 for fitting a triaxial NFW model to the triaxial cluster 
simulations discussed in Sec. 14.11 We also analyse our simulated 
data-sets assuming the spherical model and include the r esults in 
Fig. Q] To emulate the analysis of ICorless & Kind d2007l) . for the 
spherical model we further consider the case where one removes 
the galaxies within 1' of the cluster centre to avoid the strong lens- 
ing regime and only simulate galaxies out to 7.5' from the cluster 
centre. The results for fitting a triaxial NFW model to a spherical 
cluster are shown in Fig. [2] 

As expected, in all cases the flat priors result in broadest pos- 
terior distributions, while the spherical priors give the most con- 
strained posteriors. For the flat priors, we see from Fig.[T]that each 
posterior distribution contains the true parameter values with the 
95 per cent confidence region, except for the prolate LoS cluster 
which has significantly overestimated values for both M200 and 
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Figure 1. 2-D marginalized posterior probability distributions in the M20Q-C plane for (a) prolate LoS cluster, (b) prolate plane cluster, (c) oblate LoS cluster 
and (d) oblate plane cluster. The contours delineate the 68 and 95 per cent confidence regions, respectively, for the different assumed priors. 



C. In LoS orientation, the surface mass density of the cluster has 
spherical symmetry and, coupled with the thin lens approximation, 
one would expect the axis ratios and orientation angles to be largely 
unconstrained by the observed galaxy ellipticities. Therefore, the 
inferences on axis ratios a, b and orientation angles 8, <f> for our 
LoS prolate and oblate clusters are expected to be driven by their 
prior distributions. The overestimation of A/200 and C for the pro- 
late LoS cluster occurs because it has 8 = and a/b — 1, which is 
highly disfavoured by the prior distributions employed. In the ab- 
sence of strong constraints from the data, the priors thus pull these 
parameters away from their true values. The consequent inaccura- 
cies in the estimates of 9 and a/b result in an overestimation of 
M200 and C. We found that fixing the axis ratios a and b or the 
orientation angle 8 to their true values removes the bias in the esti- 
mation of A/200 and C . The oblate LoS cluster has 8 = n/2, (f> = 
which is not disfavoured by the prior distribution and therefore we 



do not see a significant bias in the posterior distributions. For the 
prolate plane and oblate LoS orientations, it is worth noting that 
the degeneracy directions of the posteriors with flat priors are al- 
most o rthogonal to those obtained in these cases bv lCorless & King 
( 2008) which may be due to the wider priors we used on the axis 
ratios, whereas the shapes of the posteriors agree well in the other 
two cases. 



As the priors become more restrictive, the true parameter val- 
ues do begin to fall noticeably outside the two-dimensional 95 per 
cent confidence region for all cases, except the oblate plane cluster. 
Even for the spherical priors, however, one obtains largely unbiased 
estimates for C, except in the prolate LoS orientation for which 
C is significantly overestimated (as explained above). Turning to 
A/200 derived from spherical priors, we see that it is significantly 
overestimated in the prolate LoS orientation, noticeably underesti- 
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± 


0.3 


163.4 


± 
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0.3 


Oblate LoS 


111.8 


± 


0.3 


112.5 


± 


0.3 


107.6 ± 


0.3 


Oblate Plane 


406.1 


± 


0.3 


346.7 


± 


0.2 


349. 8 


± 


0.3 


Spherical 


280.3 


± 


0.2 


280.6 


± 


0.2 


273.C 


± 


0.3 



Table 2. log Z values for fitting prolate LoS, prolate plane, oblate LoS, 
oblate plane and spherical clusters with triaxial (flat priors), one spherical 
cluster and two spherical cluster models. 



study. It should also be noted that the single spherical cluster model 
is favoured in the analysis of the spherical NFW cluster simulation. 



0.2 0.4 0.6 0.8 1 . 1.2 1.4 1.6 1.8 2 
M 2 oo^ M s u „ K10 « 

Figure 2. 2-D marginalized posterior probability distributions in the M20Q- 
C plane for a spherical cluster. 



mated in the oblate LoS case, slightly underestimated in the prolate 
plane orientation, and largely unbiased for the oblate plane cluster. 

The most notable inaccuracies in fitting a spherical NFW 
model are observed when we allow for more than one spherical 
cluster in the field. Bayesian model selection can then be used to 
determine whether multiple spherical clusters provide a better fit 
to the data. Table [2] lists the Bayesian evidences calculated by our 
cluster finding algorithm for one triaxial NFW cluster, one spher- 
ical NFW cluster and two spherical NFW clusters. As expected, 
one triaxial NFW cluster model is preferred for the triaxial clus- 
ter simulations with their odd axis in the plane of the sky, which 
results in the surface mass density having elliptical contours. LoS 
orientation, however, results in spherical projected iso-density con- 
tours, as discussed in Sec. 14. II and since the prior on 9 completely 
rules out the true value for the prolate LoS cluster, there is weak 
preference for the spherical model over a triaxial cluster model. 
There is no such mismatch between the true parameter values and 
the corresponding prior distributions for the oblate LoS cluster and 
therefore the Bayesian evidence does not prefer either the spherical 
or the triaxial model over the other. 

In the case of the triaxial cluster simulations with odd axis 
in the plane of the sky, and hence elliptical iso-density contours, 
one would expect multiple spherical clusters to provide a better fit 
than just a single spherical cluster. This is confirmed by the evi- 
dence values as listed in Table [2] In fact, we fitted up to 4 spherical 
clusters for both prolate and oblate plane clusters and found the 
evidence value to be still increasing. We show the posterior distri- 
butions for the spatial coordinates at which the cluster is centred for 
the 1-spherical and 4-spherical model fits to the prolate and oblate 
plane clusters in Fig. [3] One can see multiple clusters aligned along 
the i-axis and j/-axis, respectively, for the oblate and prolate plane 
clusters, which corresponds to the position angle of the projected 
iso-density contours in each case. It is clear from this figure that 
by fitting spherical NFW models to highly triaxial simulations, one 
would incorrectly infer the presence of multiple clusters. This is 
perhaps the biggest drawback of fitting spherical NFW models to 
intrinsically triaxial clusters and the most important result of this 



5 CONCLUSIONS 

We have presented the weak lensing equations for the triaxial NFW 
profile. We also presented a method to estimate the cluster param- 
eters assuming a triaxial NFW model. By considering four highly 
triaxial NFW galaxy clusters, we found that, spherical assumptions 
resulted in significant biases in parameter estimates for the cluster 
concentration C only for the prolate cluster in line-of-sight orien- 
tation. The cluster mass M200 was significantly biased, however, 
being overestimated for triaxial clusters oriented such that there is 
a lot of mass along the line-of-sight and underestimated for triax- 
ial clusters with very little mass along the line-of-sight. The ten- 
sion between the concentration C ~ 4 of clusters predicted in the 
ACDM model and the high values C ~ 8 estimated in some clus- 
ters when assuming a spherical model can therefore be explained if 
these clusters have very extreme axis ratios and oriented such that 
there is a lot of mass along the line-of-sight. Even more worryingly, 
the spherical model can lead to multiple cluster detections when ap- 
plied to highly triaxial clusters with their odd axis in the plane of 
the sky. It is therefore important to be cautious in drawing conclu- 
sions about presence of substructure in weak gravitational lensing 
cluster studies in which a spherical cluster model has been used. 

Even though our analysis algorithm is very efficient, fitting for 
triaxial cluster still takes a few days because, unlike the spherical 
case, the shear equations for triaxial NFW described in Sec. 13.21 
do not have an analytical solution and need to be solved numeri- 
cally. This makes the application of triaxial NFW model for search- 
ing large survey fi elds along the lines of the method presented in 
iFeroz et all d2008l) unfeasible. In a future study, we plan to train an 
artificial neural network for solving these triaxial NFW equations 
and apply the resultant method on survey fields. 

Another important area for studying cluster physics is through 
coherent analysis of weak/strong lensing, optical, X-ray and 
Sunyaev-Zel'dovich (SZ) observations. As we can see, analysis us- 
ing weak lensing data only results in large uncertainties in cluster 
parameters due to inherent underconstrained nature of the problem. 
It is therefore important to perform a joint analysis using several 
different da ta-sets in order to ob tain better constraints on cluster 
parameters. Marshall et al. (2003) presented a method for the joint 
a nalysis of lensing an d S Z observations which was fur ther extended 
in lFeroz et af] d2009h and iHurlev- Walker et all J201lh . These stud- 
ies assumed a spherical NFW model. We plan to extend this to a 
triaxial NFW model and incorporate the X-ray observations as well 
in a future work. 
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Figure 3. 2-D marginalized posterior probability distributions in the xo-yo plane for (a) prolate plane cluster, (b) oblate plane cluster, when analysed assuming 
1 and 4 spherical cluster models. 
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